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Abstract: In the 7 years since the first publications describing phage-displayed peptide libraries, phage display has been 
successfully employed in a variety of research. Innovations in vector design and methods to identify target clones account for 
much of this success. At the same time, not all ventures have been entirely successful and it appears that phage and host 
biology play important roles in this. A key issue concerns the role played by a displayed peptide or protein in its successful 
expression and incorporation into virions. While few studies have examined these issues specifically in context of phage 
display, the literature as a whole provides insight. Accordingly, we review phage biology, relevant aspects of host biology, 
and phage display applications with the goals of illustrating (/) relevant aspects of the interplay between phage-host biology 
and successful phage display and (//) the limitations and considerable potential of this important technology. 

Key worc/s: bacteriophage Ml 3, phage display, pill, pVIII, expression libraries. 

Resume : Au couis dcs 7 annccs qui oni suivi Ics premieres publications dccrivant dcs banqucs de peptides exposes a la 
surface dc phages, ccl afllchagc d\in phage a etc utilise avec succcs dans divers types dc recherche, Des innovations dans la 
mise au poin! dc vccteurs el dc methodcs d'idcniification de clones-ciblcs expliquent une bonne partie de ce succes. Toutes 
Ics initiatives n'onl ccpcndant pas etc couronnccs dc succcs car ta biologic du phage ct dc son bote jouerait un role 
important. Un dc\ cIoppcmcnt crucial consiste a clucidcr !c role jouc par une protcinc ou un peptide expose a la surface du 
phage dans son incor|>oration ct son expression efficace dans les virions. Mcmc si pcu d't5tudcs ont portc sur les applications 
de rafllchagc d'un phage, rcnscmblc dc la lillcrature propose des suggestions intercssanles. Dans cettc perspective, nous 
passons en revue la biologic des phages, Ics aspects pertinents dc la biologic de I'hote ct les applications de Taffichage du 
phage en vue d' identifier Ics points importants de 1' interaction hote-phagc et dc la rcussite de raffichagc du phage. Nous 
voulons aussi illusirer le potenticl considerable et aussi les limites de cette technologic importante. 

j\fofs dcs : bacteriophage iM13, affichage d'un bacteriophage, pIH, pVIlI, banques d'expression. 
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Introduction 

The publication in 1990 of three studies describing peptide 
display on the surface of a filamentous bacteriophage brought 
to the attention of many scientists a simple and powerful ap- 
proach to identifying or optimizing ligands for antibodies and 
other biomolecules. In model studies, three groups of re- 
searchers had constructed vast libraries of random sequence 
peptides fused to the N-terminus of a minor coat protein (pIII) 
of a filamentous phage (Fig. 1) and by simple affinity-selection 
methods, identified novel sequences that bound specifically to 
cognate antibodies (Scott and Smith 1990; Cwirla et al. 1990) 
or streptavidin (Devlin et al. 1990). 

Several aspects of phage biology made these studies possi- 
ble and the concept powerful. Because large numbers of 
virions occupy a small volume (»10'2 particles-mL"*), con- 
struction of libraries of the required diversity (>10^ unique 
clones; by cloning degenerate oligonucleotides into glll;^ 
Fig. IC) and their subsequent employment was achievable. 
Display of peptides in the simple physicochemical context of 
a structurally simple virion (Fig. IB) allowed for successful 
enrichment with minimal background. Importantly, affinity- 
selection of a target peptide served to clone its encoding DNA 
(Fig. ID) because the peptide and DNA were physically 

' Several conventions exist for identifying Ff phage genes and 
their products. Commonly, phage genes are identified with 
Roman numerals (e.g., gene III, gene VJII), while their encoded 
proteins are identified with Roman numerals prefixed with the 
letter p (e.g., pIII, pVIII). For clarity and convenience, we 
identify genes and their products with the prefixes g and p, 
respectively (e.g., glll, gVIII; pIII, pVIII). 



linked. 

Phage display soon found a diversity of applications. 
Among early studies were pIII display of a properly folded 
human growth hormone (Bass et al. 1990) and of single-chain 
Fv anfibodies (McCafferty et al. 1 990), as well as both pIII and 
pVIII (the major coat protein) d isplay of Fab fragments (Bar- 
Das et al. 1 i ; i^^ang et al. l yy l ). Affinity-selection other than 
with antibodies was soon demonstrated in a study called "di- 
rected evolution of a protein," which employed a library of 
variants of bovine pancreatic trypsin inhibitor (BPTI) to iden- 
tify a peptide possessing > 10^- fold greater affinity for human 
neutrophil elastase than the starting peptide (Roberts et al 
1992a). 

The value of phage display is reflected in the large number 
of published reviews. These include general overviews (Wells 
and Lowman 1992; Barbas 1993; Smith and Scott 1993; 
Clackson and Wells 1994; Bradbury and Cattaneo 1995; Per- 
ham et al. 1995; Smith and Petrenko 1997) and reviews of 
random peptide libraries (Scott 1992; Lane and Stephen 1993; 
Scott and Craig 1994; Cortese et al. 1994, 1995, 1996; Burritt 
et al. 1996) and display of proteins (Dunn 1996), as well as 
more focussed reviews concerning affinity-maturation themes 
(O'Neil and Hoess 1995), display of constrained peptides 
(Ladner 1995), antibody display (e.g., Marks et al. 1992; Win- 
ter et al. 1994), and the use of zinc finger libraries (Choo and 
Klug 1995). 

Notwithstanding the successful employment of phage dis- 
play in a variety of research problems, not all applications have 
been entirely successful and it appears that phage and host 
biology play important roles in this. Expectedly, as with pro- 
tein expression generally, a key issue concerns the ways in 
which a displayed peptide or protein influences its ability to 
be expressed and incorporated into virions. Although few stud- 
ies have examined these issues specifically in context of phage 
display, the literature as a whole provides insight. Accord- 
ingly, we review relevant aspects of phage and host biology 
and phage display applications with the goals of (/) shedding 
light on the ways in which phage and host biology can interfere 
with phage display and (//) illustrating both the limitations and 
considerable potential of this important technology. 

Ff phage biology 

Overview 

Ml 3, fl, and fd (reviewed^ in Rasched and Oberer 1986; 
Model and Russel 1988; Russel 1991, 1995) are collectively 
called the Ff phage because of their filamentous appearance 
and almost total dependence (Russel et al. 1988) on the F-pilus 
for infection of their Escherichia coli host. The long (900 nm) 
and narrow (7 nm) Ff virion (Fig. lA) consists of a single- 
stranded (ss) DNA genome packaged in a tube comprised of 
2700 co pies of the major coat protein pVIII and closed at the 
ends by four or five copies of eacn of tour Species of minor 
coat proteins, including pIII (Fig, IB). Unlike lytic phage, 
which are released by cell lysis after assembly in the host cell 
cytoplas m, Ff phage are continuouslv extruded through the h ost 
j:ell envelope m a process that couples assembly with export. — " 
Injection begins when pIII adsorbs to the tip of a host cem 



^ Except where noted, material presented in this section is derived 
from the cited reviews. 
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Fig. 1, Wild-type Ff phage and multivalent pIII phage display. (A and B) Wild-type filamentous phage that infect F-piliated£. coli(¥f 
phage). (A) Ff phage are narrow flexible rods of the indicated dimensions. (B) Cartoon illustrates inherent simplicity of virion structure: a 
single-stranded circular genome is surrounded by -2700 copies of the major coat protein pVIII and 4 or 5 copies of each of four species of 
minor coat proteins, including pIII, which binds to a host ceil F-pilus. For a model derived from X-ray fibre diffraction and other data, the 
interested reader is directed to Marvin et al. (1994). (C) A number of multivalent pIII phage display vectors are derivatives of fd-tet (Zacher 
et al. 1980) and accordingly possess a tetracycline-resistance gene (Tet*^) inserted into the intergenic (I.G.) region. (C and D) Inserting foreign 
DNA into glll at a point corresponding to the pIII N-terminal signal peptidase cleavage site results in the display of a foreign peptide as an 
N-terminal fusion to each mature pIII molecule. (D) As described in the text, a key feature of phage display libraries is that an affinity- 
selectable target peptide and its encoding DNA are packaged together. 




F-pilus. After the virion is brought to the cell surface and the 
ss genome (or (+) strand) is delivered to the cytoplasm, host 
polymerases employ the (+) strand as a template for synthesis 
of a complementary (-) strand, yielding a double-stranded (or 
replicative form (RF)) phage genome. Synthesis of the (-) 
strand begins at on(-), one of several sites within the inter- 
genic region. Because of the double-stranded nature of two 
adjacent stem loops, on(-) resembles the -35 and -10 regions 
of RNA promoters; the resemblance is important for primer 
synthesis by RNA polymerase (Higashitani et al. 1997). Sur- 
prisingly, mutants with defects in on(-) are viable, including 



M13 mutants with deletions in on(-) (Kim et al. 1981) and 
fd-tet, a vector constructed by splicing a tetracycline-resistance 
gene into on{~) (Zacher et al, 1980; Smith 1988) at a position 
that interrupts one of the stem loops. Initiation of (-) strand 
replication is delayed in these mutants and proceeds slowly by 
an unidentified mechanism (Kim et al. 1981; Smith 1988; Hi- 
gashitani et al. 1992). 

Following (-) strand synthesis, host-mediated processes 
lead to producdon of coat (pIII, pVI, pVII, pVIII, pIX), assem- 
bly and export (pl and pIV), and phage replication (pll, pV, 
pX) proteins, the levels of which are determined mainly by a 
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conceptually simple mechanism. Thus, the genome is organ- 
ized into two coding regions, one for proteins required in large 
quantities (e.g., pVIII) and possessing a strong initial promoter 
and the other for proteins required in fewer numbers (e.g., pIII) 
and possessing a less efficient initial promoter. Multiple pro- 
moters within each region lead to more frequent transcription 
of 3' versus 5' genes. Host-mediated degradation of mRNA 
transcripts from the 5' end magnifies this cascade effect, so 
that a finely tuned balance of synthesis of phage proteins is 
achieved with little apparent feedback control (but see Fulford 
and Model (1988^7, 1988 A) for regulation of (-) strand synthe- 
sis by pll, pX, and pV). Prima facie, this balance would appear 
to be sensitive to perturbation. 

Phage replication is initiated by pll, which nicks the (+) 
strand at on(+) in the intergenic region, allowing the freed 3' 
end to serve as a primer for rolling circle replication on the (-) 
strand template. As each nascent (+) strand is completed, pll 
ligates the molecular ends to form a closed circle. Early in 
infection, new (+) strands serve as templates for additional RF 
molecules as described earlier, while later in infection, ss-to- 
RF conversion is arrested as nascent (+) strands become se- 
questered by increasing levels of the ss DNA binding protein 

pv. 

^ Virion assembly and export require the host cell mem- 
branes. Following synthesis, all five coat proteins are inserted 
into the inne^ membrane ^rFnHpmann and Model 1995). Al- 
though pill and pVllI are synthesized with N-terminal signal 
peptides that are cleaved upon membrane insertion, they re- 
main anchored in the membrane by C-terminal hydrophobic 
regions wi th their N-t ennini in rhp pp|-jplac:m^hAr^nc mem- 
brane insertion of pill may be Sec dependant because of its 
relatively large size (for size as a detemiinant of Sec depend- 
ence, see Pugsley (1993)) and recombinant pIII appears to 
require Sec functions (Peters et al. 1994), pVIII insertion is 
regarded as Sec independent (Pugsley 1993; Kuhn 1995; Mur- 
phy and Beckwith 1996). In current models, pi and pIV form\ 
a gated multimeric export channel spanning the inner and outer ) 
membranes. Initiation of assembly and export involves interV 
actions among a packaging signal within the intergenic region, 
the pl-pl V complex, host thioredoxin, and minor coat proteins 
pVII and pIX. In a process involving ATP hydrolysis and 
proton motive force (Feng et al. 1997), nascent virions arA 
extruded through the export channel with the concomitant re- 
placement of pV by pVlH. Addition of pVI and pIII to the end / 
of an extruding virion completes its assembly. / 

pIII 

pIII is required not only for F-pilus adsorption but also for 
terminating virion assembly and stabilizing the viral particle. 
Delays in supplying pIII lead to production of multilength viral 
particles (polyphage) containing two or more unit-length 
phage genomes (Rasched and Oberer 1986; Model and Russel 
1988); even wild-type Ff phage routinely produce about 5% 
polyphage. The knob-on-stem appearance of pIII in electron 
micrographs reflects its two functional domains. The N-terminal 
two-thirds of the molecule mediates virion adsorption while 
the C-temiinal one-third functions in virion assembly and 
structure (Armstrong et al. 1981; Crissman and Smith 1984; 
Model and Russel 1988). Thus mutants producing pIII that 
lacks a substantial portion of the N-terminal domain produce 
normal numbers of noninfectious unit-length virions (mono- 



phage; Crissman and Smith 1984), but mutants unable to pro- 
duce at least the C-terminal portion of pIII are host lethal or 
(in an fd-tet background, described later) produce relatively 
less stable noninfectious polyphage (Pratt et al. 1966; Criss- 
man and Smith 1984). 

Many of the membrane-associated phenotypes exhibited by 
Ff phage infected cells, such as increased detergent sensitivity 
and leakage of periplasmic proteins, are mediated by pIII (Nel- 
son et al. 1981; Boeke et al. 1982; Rampf et al. 1991). 

Early phage display 

The first phage display library 

The first phage display library (PDL) was constructed by 
Smith (1985), who cloned £coRI gene fragments into glll of 
phage fl, creating a library of virions displaying peptides be- 
tween the pIII N- and C-terminal domains. Although identifi- 
cation of a clone reactive with a-EcoRI antibodies demonstrated 
the considerable promise of this novel approach. Smith noted 
several problems related to phage biology. The identified 
clone produced small, barely detectable plaques and increased 
numbers of polyphage. Infectivity was reduced 25-fold and 
loss or alteration of the EcoKl insert conferred a marked se- 
lective advantage during virion propagation. 

fd-tet 

Ff mutations that block phage assembly while allowing DNA 
replication to continue are normally lethal to the host, possibly 
because of intracellular accumulation of phage DNA and gene 
products (Pratt etal. 1966; Smith 1988; Smith and Scott 1993). 
Because of its o/7(-) defect, however, the RF copy number of 
fd-tet is reduced about 13-fold and otherwise lethal (in a wild- 
type background) morphogenetically defective mutants of fd- 
tet are thus viable (Crissman and Smith 1984; Smith 1988; 
Smith and Scott 1993). Parmley and Smith (1988) exploited 
this property of fd-tet in designing frameshift vectors, de- 
scribed later. 

Compared to wild-type Ff, fd-tet plaques are extremely 
small and turbid; virion production is reduced fourfold, and 
yields of RF and ss DNA are reduced more than 10-fold (Smith 
1988). Ml 3 ori{-) mutants have similar plaque phenotypes 
and reduced virion yields (Kim et al. 1981). Infectivity of 
fd-tet (infectious units/physical particle) is reduced to as little 
as 2.5% of that of wild-type Ff (Smith 1988), although infec- 
tivity of an fd-tet derived vector was found to be 50% of 
wild-type levels (Parmley and Smith 1988). 

SOS response induction 

Relatively recently, Ml 3 mutants with ori{-) defects have 
been shown to induce the £. coli SOS response (Higashitani 
et al. 1992, 1995), presumably because of delayed (-) strand 
synthesis or related events that expose the cell to ss DNA, the 
apparent SOS system inducer (Walker 1996). The phenotypic 
similarities of M13 ori{~) mutants and fd-tet suggest the latter 
may also induce the SOS response. Generally, SOS induction 
leads to increased spontaneous mutagenesis (Hutchinson 
1996), such as in certain base substitution mutations (Miller 
and Low 1984; Yatagai et al. 1991a, \99\b\ as well as in- 
creased induction of recA and other recombination-related 
genes (Lloyd and Low 1996; Walker 1996) and a consequent 
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increased frequency of homologous recombination (Abbot 
1985; Dolzani et al. 1991; Mudgett 1991). Given that many 
phage display vectors have been derived from fd-tet (Smith 
and Scott 1993; Bonnycastle et al. 1996), SOS response induc- 
tion by this vector merits investigation. 

The first biopanning 

Continuing Smith's work, Parmley and Smith (1988) showed 
that virions displaying peptides of interest could be affinity- 
selected (or biopanned) from a mock library containing a huge 
(10^-fold) excess of virions lacking the target peptide. Several 
improvements, reflecting a consideration of phage biology, 
were made in vector design. Displayed peptides were moved 
from the earlier mid-pIII site to the N-terminus of mature pIII, 
allowing for greater conformational freedom and reducing po- 
tential interference with the adsorption function of pIIL As 
well, fd-tet (versus wild-type fl) was employed as a parent for 
new vectors, with the hope that the reduced gene expression 
and virion production of fd-tet (Smith 1988) 'might diminish 
the host toxicity of some difficult to export peptides (Smith 
and Scott 1993). Finally, knowing that plll-deficient mutants 
of fd-tet were viable, Parmley and Smith (1988) developed a 
frameshift vector (fUSEl) in which a frameshift at the gill 
cloning site prevented production of pIII and thus of infectious 
virions, unless a foreign insert restored the gill reading frame. 
Here the goal was to allow construction of libraries in which 
each infectious virion was a recombinant, a clear advantage in 
eliminating possible overgrowth of nonrecombinants during 
library propagation and in assessing cloning success and the 
completeness of a libraiy. 

Some problems remained, particularly with fUSEl recom- 
binants bearing larger inserts. Thus, 54- and 71 -bp inserts re- 
duced infectivity by 50%, and a 335-bp insert reduced 
infectivity 20-fold. Importantly, a 20-bp insert had no apparent 
effect on infectivity. 

The first random peptide libraries 

Three pioneering random peptide library (RPL) applications 
(Cwiria et al. 1990; Devlin et al. 1990; Scott and Smith 1990) 
validated the proposal of Parmley and Smith (1988) that large 
libraries of phage-displayed peptides could be used to identify 
epitopes, mimotopes (Geysen et al. 1987), or other mimetic 
peptides. To construct their >IO^-member hexapeptide RPL, 
Scott and Smith ( 1 990) designed both their vector (fUSE5) and 
inserts in a way that reflected a consideration of phage and host 
biology. To minimize problems in signal peptidase processing 
of recombinant pIII, the first two residues following the cleav- 
age site were retained (with Asp substituting for the normal 
Glu in Ala-Glu) in their clones; these were followed by a 
random hexapeptide flanked by Gly-Ala-rich linker se- 
quences. Other features of fUSE5, including a glll frameshift, 
ser\*ed to exclude nonrecombinants from their library. Cwiria 
et al. (1990) also used an fd-tet derived frameshift vector to 
construct their hexapeptide RPL; here, however, random pep- 
tides were inserted immediately following the signal peptide 
cleavage site. 

H stc libi I gy 

From the notable successes of these RPL studies, it seemed 
evident that phage-host biology did not interfere with the dis- 



play of small plll-displayed peptides, at least in ways that were 
of practical consequence. The previously reviewed successes 
with display of proteins, such as human growth hormone, bo- 
vine pancreatic trypsin inhibitor variants, and antibodies, 
tended to support this idea. Yet, as expected from the earliest 
studies and anticipated by Parmley and Smith (1988), a review 
of the issues involved should predict that many applications 
may not succeed for the reason that, as with recombinant tech- 
nology generally, Ff phage display requires host cell processes 
for both synthesis and export of recombinant pIII or pVIII 
molecules. 

Intuitively, we expect diverse proteins to make varying de- 
mands on the export appartatus and indeed, a body of experi- 
mental data (Pugsley 1993) suggest that proteins vary in their 
requirements (e.g., for specific components of the Sec appara- 
tus; for membrane potential) for efficient export. Accordingly, 
members of a compositionally diverse PDL may be subjected 
to different constraints, leading to varying degrees of success 
in display of member peptides. 

Signal peptides and the N-terminal mature region 

Importantly, phage display involves fusions to or insertions 
near the N-termini of mature pIII or pVIII, a region which in 
proteins is generally believed to play a role in export. Indeed, 
virtually all RPLs display peptides within a postulated N- 
terminal "critical 30-residue export initiation domain" (An- 
dersson and von Heijne 1991; Nielsen et al. 1997; also see 
Summers and Knowles 1989). 

Both pin and pVIII preproteins possess N-terminal signal 
peptides that are cleaved upon membrane insertion and much 
of our understanding of this process has derived from studies 
employing pVIII (Model and Russel 1988; Pugsley 1993). 
Both processing by signal peptidase and subsequent virion 
production are affected in some pVIII mutants possessing resi- 
due substitutions within the N-terminal mature region. For 
example, pVIII preprotein cleavage is reduced by an Ala— >Cys 
substitution at position +1 (viz., immediately following the 
cleavage position) and abolished by Ala^Pro or Ala— >Thr 
substitution. A Glu-»Leu or Glu— >Tyr substitution at position 
+2 retards pVlII processing and, in the case of a Glu->Leu 
substitution, affects the kinetics of virion production (Boeke 
et al. 1980; Russel and Model 1981). Although pVIII tolerates 
a number of substitutions at positions +1 to +5 (lannolo et al. 
1995; Williams etal. 1995), certain substitutions (most notably 
by Tyr) can reduce virion titers by >10'^-foid (lannolo et al. 
1995). 

Early difficulties (e.g., Greenwood et al. 1991; Felici et al. 
1 99 1 ; see also Perham et al. 1 995) in p VIII display of peptides 
larger than six amino acids raised the possibility of a size or 
other restriction related to virion assembly (e.g., steric inter- 
ference in pVlII-pVIII interactions) or export through the 
pI-pIV channel and there is evidence (lannolo et al. 1995; 
Williams et al. 1995) that such restrictions exist. On the other 
hand, X-ray diffraction studies and model building (Malik et 
al. 1996) have indicated that pVlII is able to display larger 
structures than accomplished thus far in vivo. Importantly, 
related work (Malik et al. 1996) found that the success of pVIII 
display correlated better with efficiency of preprotein process- 
ing by signal peptidase than with the sizes of displayed pep- 
tides. 

Study of the N-terminal mature region of recombinant pIII 
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is limited to a single report (Peters et al. 1994) where it was 
found that (/) 5% of clones in a hexapeptide RPL yielded 10- 
to 10^-fold fewer virions than a parental (fd-tet) phage, (ii) 
positively charged (Arg) residues within the mature pIII N- 
terminal decamer of a second PDL reduced virion titers in a 
dose-related manner from -10*^ virions-mL'' to as little as 10^ 
virions* mL'*, and {Hi) prlA {secY) suppressors could restore 
virion production of most defective clones 10^- to 10^-fold. 
Charged residues in the N-terminal mature region have been 
shown to hinder translocation of other recombinant proteins 
(e.g., Li et al. 1988; Andersson and von Heijne 199 1; Yamane 
and Mizushima 1988; reviewed in Boyd and Beckwith 1990). 
It has been suggested that (/) these disrupt a net dipole formed 
around a signal peptide's hydrophobic domain and required 
for translocation (Boyd and Beckwith 1990; Izard and Kendall 
1994) and (//) clusters of these residues may, by conforming 
to the positive inside rule (von Heijne 1994; see also Yamane 
and Mizushima 1988) derived from study of membrane pro- 
teins, block membrane translocation of the signal peptide C- 
terminal domain. 

Surveys of prokaryotic signal peptides and flanking se- 
quences (Nielsen et al, 1997; von Heijne 1986) have revealed 
a marked distribution of amino acids around the signal pepti- 
dase cleavage site, including preprotein (-1,-2, ...) and mature 
protein (+1, +2, ...) positions. For example, Pro is relatively 
common in positions -6 to -4 and +2 to +6 but rare"* in posi- 
tions -3 to +1 (Nielsen et al. 1997), Ala is the most common 
(mol% = 38.3) residue at +1, while Asp (16.2%) and Glu 
(15.0%) predominate at +2. Notably, these (Ala-Asp or 
Ala-Glu) residue pairs initiate mature pHI of wild-type Ff 
phage and, by design, of recombinants in the RPLs of Scott 
and Smith (1990) and Devlin et al. (1990) and of several phage 
display vectors (Burritt et al. 1996). In contrast, foreign pep- 
tides are displayed at, rather than a few residues away from, 
the +1 position of pIII in the RPL of Cwirla et al. (1990). The 
evidence above suggests that this latter approach may be prob- 
lematic. 

We examined this issue theoretically following experimen- 
tal work that employed phage libraries displaying peptides 
derived from the BordeteUa pertussis filamentous hemagglu- 
tinin gene (Wilson 1997) by using the neural networks of 
Nielsen et al. (1997) to predict signal peptide recognition and 
cleavage expected from display of the 30 target clones we 
identified. Predictions were based on use of (/) the pIII display 
vectors we employed, in which peptides follow the N-terminal 
spacer sequences Ala-Asp-Gly-Ala-GIy-Ala (vector 
fDRW70, similar to fUSE5 of Scott and Smith (1990)) and 
Ala-Asp-Gly-Pro (fDRWSwn vectors) and (//) a hypothetical 
pIII vector that displays a peptide without a spacer, viz., at the 
N-terminus. High cleavage site scores were reported for all 
fDRW70 constructs. Most fDRWS/;/; constructs scored below 
a default cutoff value (0.49), but only marginally so (>0.43). 
In contrast, half of the constructs in the third hypothetical 
vector yielded scores less than 0.3, with some as low as 0.15; 
moreover, most constructs were predicted to have altered 
cleavage sites. 

Because pIII and pVUI preproteins possess N-terminal (sig- 



Calculated mol% values are derived from data employed in the 
study by Nielsen et al. (1997) and provided by the authors via 
their FTP server at ftp://vinisxbs.dtu.dk/pub/signalp. 



nal peptide) and C-terminal (membrane anchor) hydrophobic 
domains, foreign inserts that hinder signal peptide cleavage 
could effectively convert these preproteins to polytopic mem- 
brane proteins. Given the crude regulation of phage gene ex- 
pression and general lack of feedback control, elevated 
membrane accumulation of pIII (which at normal levels me- 
diates membrane effects) or pVIII would be expected, to the 
detriment of the host. Consistent with this, mutants in which 
pVIlI synthesis continues while phage morphogensis is 
blocked are host lethal and mediate a variety of membrane 
effects (Russel and Model 1981); these include mutants 
blocked in signal peptidase processing (Rasched and Oberer 
1986). 

Other aspects of membrane insertion-translocation 

The amino acid sequence of a protein can influence more than 
signal peptidase activity. For example, the normally Sec- 
independent membrane insertion of pVIII became Sec de- 
pendent when the size of periplasmic loop (transiently formed 
between the signal peptide's hydrophobic core and the C- 
terminal membrane anchor) was increased and in a related way 
Sec dependence of a model membrane protein increased con- 
comitantly with the size of its periplasmic loop (von Heijne 
1994). It has been postulated that only relatively short 
polypeptides are candidates for Sec-independent membrane 
translocation and their small size may actually preclude pro- 
ductive interaction with some Sec components (Pugsley 
1993), In this context, Pugsley (1993) has questioned whether 
a protein can be designed to be both too long to be Sec inde- 
pendent and too short to be Sec dependent. This intriguing idea 
suggests that certain intermediate-sized pVIII recombinants 
may prove to be intrinsically incompetent for export. 

Both negatively and positively charged residues can mark- 
edly influence membrane translocation (von Heijne 1994; Cao 
et al. 1995). For example, increasing the negative charge of 
the periplasmic loop in pVIII and leader peptidase constructs 
resulted in increased dependence on the electrochemical mem- 
brane potential or the Sec apparatus for translocation (Cao et 
al. 1995). Also, inserting four or five Asp and Glu residues at 
the beginning or middle of the periplasmic loop of pVIlI in- 
hibited membrane insertion or leader peptidase processing to 
as little as 18% of wild-type levels (Cao et al. 1995). 

These findings may account for observations (Grihalde et 
al, 1995) concerning a 30-residue RPL in which variable resi- 
dues followed the N-terminal epitope tag DYKDDDDK and 
four other residues (AETA), the epitope tag having been in- 
cluded to allow assessment of N-terminal proteolysis of re- 
combinant pIIL That only 5-10% of virions displayed an intact 
tag and infectivity of pooled library virions was only 0.4% are 
consistent with problems in translocation or processing of the 
charge-rich sequence. In support of this, the neural networks 
mentioned eariier (Nielsen et al. 1997) predict that a pIII pre- 
protein sequence constructed with the DYKDDDDK epitope 
tag lacks a signal peptide.^ 



The prediciton was made using the WWW server 
(http://www.cbs.dtu.dk/) of the Center for Biological Sequence 
Analysis and the preprotein sequence VKKLLFAIPLWPF- 
YSHS-DYKDDDDKAETA, where - indicates the cleavage site 
in wild-type pIII. The sequence yielded a cleavage site score 
below (66% oO the default cutoff value for predicting cleavage. 
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Genomic libraries, membrane anchors, and secondary 

signal sequences 
Genomic PDLs may pose related problems, for a large fraction 
(20%) of chromosomal sequences can act as signal peptides 
(Kaiser et al. 1987). Given similarities among signal peptides, 
signal anchors, and membrane anchors (Maclntyre and Hen- 
ning 1 990; Nielsen et al. 1 997) and that requirements for trans- 
membrane segments include only minimal length ( 1 2 residues) 
and hydrophobicity (Pugsley 1993), then a significant fraction 
of genomic PDL clones may contain motifs that hinder translo- 
cation, anchor pIII, or pVIII in the membrane or act as secon- 
dary signal peptides (Maclntyre and Henning 1990), yielding 
unexpected cleavage. 

Codon usage and context 

Codon usage and translational context in £. coH (reviewed 
briefly in Smith and Smith 1996; Berg and Silva 1997) may 
adversely affect synthesis of recombinant pVIII (--2700 copies 
required/virion) and pIII (less so; 4-5 copies/virion), particu- 
larly for genomic PDLs derived from genomes with G+C con- 
tents substantially different from that of £. coli. Codon usage 
within the B, pertussis ftiaB gene provides an example with 
which we are familiar (Wilson 1997). Analysis of 196 JlmB 
Arg codons shows that the most common Arg codon in E. coli 
(CGU) tends to be avoided in ffiaB, and that JliaB employs 
other Arg codons that are rare in E. coli (Dong et al. 1996). 
These include 15 occurrences of AGG, found to adversely 
affect recombinant gene expression (Kane 1995). Escherichia 
coli avoids not only the AGG codon but also the AG-G triplet 
(- denotes a boundary between codons) for reasons possibly 
related to translation (Smith and Smith 1996; Berg and Silva 
1997). Notably, AG-G accounts for less than 4% of a data set 
of 742 E. coli tetranucleotides (Smith and Smith 1996) but 
38% of our JhaB data set. 

Molecular chaperones and export targeting 

Nascent and completed proteins interact with a variety of 
molecules with roles in folding and export, including trigger 
factor, DnaJ/DnaK and GroEl/GroES, as well as SecB and 
other Sec components (Pugsley 1993; Hartl 1996; Hesterkamp 
and Bukau 1996; Mayhew and Hartl 1996; Murphy and Beck- 
with 1996). Importantly, the attachment of a signal peptide to 
a normally cytoplasmic protein does not necessarily ensure its 
export (e.g., Summers and Knowles 1989; Andersson and von 
Heijne 1991). Rather, a protein may also require certain motifs 
(patterns of hydrophobicity, positively charged and flexible 
polypeptide segments) that allow maintenance by molecular 
chaperones of a partially unfolded export-competent state 
(Maclntyre and Henning 1990; Pugsley 1993; Driessen 1994; 
Mayhew and Hartl 1996; Murphy and Beckwith 1996). 

Although chaperone-assisted phage display of antibodies 
(Lah et al. 1994; Soderlind et al. 1995) and recombinant pro- 
tein export generally (Wall and Pluckthun 1995) have been 
explored (with mixed results), little is understood regarding 
the influence of molecular chaperones on the export of recom- 
binant pIII and pVIII. Possibly, fusion of a small peptide to the 
relatively large pIII (424 residues) has little impact on export 
competence, whereas fusion of a large normally cytoplasmic 
protein to the smaller pVIII (50 residues) may pose greater 
difficulties. 



Proteases 

Protein degradation (Miller 1996) is a highly active compt 
nent of £. coli metabolism, employed to both regulate leve 
of functional proteins and eliminate damaged proteins. In gei 
eral terms, certain alterations tend to increase the rate of de; 
radation of otherwise stable proteins. These include tl 
production of longer or shorter versions of a normal protein < 
amino acid substitutions that lead to exposure of elements th 
in the native protein are inaccessible. It has been suggested th 
since proteins and proteases have coevolved, protein evolutic 
is constrained by a proteolytic machinery effectively designe 
to recognize nonnative elements. In this context, recombina: 
pIII and pVIII may be candidates for degradation. 

As illustrated by later examples, there is evidence that pn 
tein degradation is an issue in phage display. Yet its impo 
tance may be obscured by the not uncommon practice ( 
inferring the sequence of a displayed peptide from that of i 
encoding glll or gVIII insert without corresponding asses 
ment of the fusion products, such as by SDS-PAGE, Westei 
blotting, and N-terminal sequencing (as illustrated in Hart 
al. 1994). 

Impact of phage and host biology on phage display 

Although problems concerning the quantity and quality of e: 
ported proteins can (or must, out of practical necessity) I 
ignored in some applications of recombinant technology, the^ 
matters assume greater importance with phage display, for tl 
utility of a PDL will often derive from unbiased reliable di 
play of all encoded peptides. The sections above describe 
general terms the ways in which phage and host biology mr 
interfere with phage display. The following sections, in n 
viewing applications of phage display, provide a more specif 
sense of the degree to which such interference is an issue ar 
the extent to which it has been examined. 

Applications and innovations in phag 
display 

Random peptide libraries 

Library diversity and bias 

RPLs displaying peptides of 6-38 amino acids have been d. 
scribed. Two common concerns in library construction are i 
diversity or completeness and bias. RPLs are generally coi 
structed with only 32 codons (e.g., of the form NNK or NN^ 
where K = G or T and S = G or C) to reduce a bias inherent i 
the genetic code from a maximum amino acid molar ratio t 
6:1 (e.g., Arg versus Met) to 3:1. A hexapeptide library di 
playing all possible hexapeptides (a complete library) thi 
requires 32*^ (= 10^) unique clones and, assuming a practic; 
upper limit of --10^-10'^ (Smith 1993), libraries of peptidt 
longer than seven residues accordingly risk being incomplet> 
Nevertheless, libraries of relatively long peptides may ha\ 
advantages. For example, assuming that a hexapeptide RPL 
adequate for a given application, a 20-residue library may ii 
crease the effective library diversity, for each 20-residue pe| 
tide contains 1 5 hexapeptides in different flanking sequenc 
(possibly structural) contexts. 

Whether a library needs to be complete depends, in a give 
application, on the contributions to binding of individual res 
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dues within the target peptide. If all residues contribute 
equally, then assured isolation of the single most tightly bind- 
ing clone will require that the library contain all possible 
clones. Where only a few residues contribute to binding, com- 
pleteness matters less; what remains important in this case is 
that there be little or no position-specific bias, such as against 
certain residues in the residues flanking the pIII signal pepti- 
dase site. 

That completeness is not always essential can be illustrated 
by r\vo examples of results obtained from a library of limited 
diversity. Thus, although the library of Devlin et al. (1990) 
displayed only 1/10^ of the 20'^ possible 15-residue peptides, 
it was successfully employed to (/) identify a monoclonal an- 
tibody (MAb) reactive peptide that, sharing six (of 1 3) residues 
with the native antigen, could be used to induce antigen-reac- 
tive antibodies (Motti et al. 1994) and (//) identify a MAb-re- 
active peptide that, sharing only three residues (over a 
5-residue span) with the herpes simplex virus type 1 (HSV-1) 
native antigen, could when used as an immunogen protect 
mice against subsequent HSV-1 challenge (Schellekens et al. 
1994). Conversely, the importance of a large and diverse li- 
brary is suggested by the finding that, whereas antibody phage 
repertoires of -10^ clones have characteristically yielded anti- 
body fragments of only moderate affinity |iM), a 6.5 x 10'^ 
clone repertoire provided antibodies to a range of antigens and 
haptens with affinities <10 nM (Griffiths et al, 1994). 

That some biological bias exists in RPLs seems clear, as 
indicated by a study by Peters et al. (1994) where it was found 
that 5% of clones in a hexapeptide RPL produced 10- to 10^- 
fold fewer virions than a parental (fd-tet) phage. What remains 
largely unclear (but see Peters et al. 1994) is the nature and 
origin of the bias. 

The issue of position-specific bias against certain residues 
in the early mature region of pi II or pVIII fusion products has 
received little experimental attention. Although, in the single 
study of which we are aware, a careful analysis of 52 randomly 
selected decapeptide library clones found no evidence of 
position-specific bias (DeGraaf et al. 1993), it should be noted 
that random peptides in the library analysed were insulated 
from the pill N-terminus by a relatively lengthy 7-residue 
linker sequence initiated by Ala-Asp (favoured at the N- 
terminus, as discussed earlier). Accordingly, the findings may 
not be broadly representative of other RPLs. 

Crude measures of bias, derived from the overall amino 
acid content of a small number of randomly selected clones, 
suggest that small degrees of bias exist in some RPLs. Possi- 
bly, these reflect greater sequence- or position-specific effects 
or as suggested by one study (Bonnycastle et al. 1996), bias in 
oligonucleotide synthesis. Thus, various studies have reported 
or X" analyses of their published data suggest, that (/) - 1 .8-fold 
more Gly and Lys, and about half the expected Pro were rep- 
resented in one RPL (Cwirla et al. 1990); (//) more Gly and 
less Pro than expected were represented in certain other RPLs 
(Jellis et al. 1993; Burritt et al. 1996); and (;/7) 2.1-fold more 
Gly (Dybwad et al. 1995a) and less than one-third the expected 
Cys (Dybwad et al. 19956) were represented in other RPLs. 

One study in which bias favouring Gly and disfavouring 
Cys were noted (Blond-Elguindi et al. 1993) also found that, 
in a manner consistent with their expected influence on mem- 
brane insertion and signal peptide cleavage, Asp and Glu were 
slightly more abundant towards the N-terminus, the reverse 



being true for Lys. Bias against Cys may be attributable to an 
idea put forth to explain an observed apparent selection against 
odd numbers of Cys residues (ICay et al. 1993); here it was 
reasoned that an unpaired Cys could form a bond with one of 
the eight native pill Cys residues, thus affecting phage infec- 
tivity. 

Representative results: affinity-selection with antibodies 
A straightforward example of a successful RPL application is 
provided in a study (Conley et al. 1994) in which affinity- 
selection from a 1 5-residue RPL of limited diversity with an 
anti-human immunodeficiency virus- 1 MAb yielded 20 MAb- 
reactive clones that could be grouped into the four consensus 
sequences XXDKW, XLD%V^, EXD%W, and ELD%W 
and thus be used to readily identify ELDKW as the native 
epitope. Not all RPL studies yield such apparent success. For 
example, affinity-selection from a hexapeptide RPL with 
MAbs against a-keratin yielded substantial enrichment for tar- 
get clones for only one of seven MAbs (Bottger and Lane 

1994) . Although later studies with five MAbs (Bottger et al. 

1995) had greater success with a 20-residue RPL than with a 
12-residue RPL, the small number of 20-residue sequences 
recovered and their limited homology with the native protein 
were not enough to identify an epitope. Rather than a position- 
specific consensus sequence, a compositional theme emerged 
from a study (Sioud et al. 1994) employing human autoanti- 
bodies against tumor necrosis factor alpha. Thus, 45 of 63 
affinity-selected clones analysed displayed the Ser- and Pro- 
rich sequences ASSLLASSP, NSSPYLNTK, or PQSPGSSFP 
and the remaining 17 clones also contained Ser or Pro. 

Some RPL biopannings may yield misleading results. In 
discussing reasons for this. Smith and Scott (1993) have noted 
two consensus sequences, PWXWL and GDWVFI, that can 
arise from biopanning with both antibodies and other ligates 
and which may bind components of the biopanning system 
(e.g., streptavidin; Roberts et al. 1993). Indeed, phage display 
has been used to identify peptides that bind a number of such 
components, including streptavidin (Devlin et al. 1990; Kay et 
al. 1993), biotin (Saggio an^d Laufer 1993), mouse IgG Fc 
region (Kay et al. 1993), and plastic (Tyr- and Trp-rich se- 
quences; see Adey et al. 1995). One study (Jellis et al. 1993) 
reported enrichment for both MAb-reactive target clones and 
clones bearing a streptavidin-binding motif (HPQN; Devlin et 
al. 1990), while another found that the common blocking agent 
bovine serum albumin could enrich for trypsin-displaying 
phage (Corey et al. 1993). 

Affinity-selection with non-antibody molecules 
As with antibodies, affinity-selection from RPLs with non- 
antibody molecules may yield peptides whose sequences make 
immediate and intuitive sense or which are quite unexpected. 
Thus, two groups (Cheadle et al. 1994; Sparks et al. 1994), 
who independently biopanned RPLs with fusion proteins 
containing the Src SH3 domain, identified similar proline- 
containing sequences related to a known SH3 domain binding 
motif Similarly apparent success, in this case a collection of 
sequences containing the RGD triplet, resulted from affinity- 
selection from 6- and 1 5-residue RPLS with the integrin a,,p3 
(Healy et al. 1995). In contrast, affinity-selection from a 
hexapeptide RPL with S-protein (a fragment of bovine pancre- 
atic ribonuclease) identified a sequence motif, %NF%Vv%, 
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that bore little resemblance to S-protein's natural ligand 
(Smith et al. 1993). An even less sequence-specific and ac- 
cordingly fitting motif was found by affinity-selection from 8- 
and 10-residue RPLs with the molecular chaperone BiP. The 
motif, e%0Xexe (O = a large hydrophobic residue), fits 
welt with the role of BiP in recognizing partially folded 
polypeptides (Blond-Elguindi et al. 1 993). In this latter study, 
the use of ATP to elute target clones by stimulating the ATPase 
activity of BiP is noteworthy. 

Affinity-selection from a 38-residue library with strepta- 
vidin (Kay et al. 1993) illustrates several aspects of the em- 
ployment of RPLs. Of the two classes of sequences recovered, 
one provided a consensus sequence (HP^/j^X, where X = a 
nonpolar residue) and the other provided no clear motif In- 
triguingly, clones of the nonmotif class bound to streptavidin 
with -fivefold greater affinity than selected HP%X clones. 
An advantage of disulfide-constrained display was suggested 
by the finding that the affinity-selected sequence *CHPQAC* 
(* denotes potential disulfide bond partners) bound 100-fold 
less efficiently after treatment with a reducing agent. 

Substrate phage 

Substrate phage is an application of RPLs pioneered (Mat- 
thews and Weils 1993) to identify peptide substrates of pro- 
teases and which relies on the resistance of Ff phage to most 
proteases (Rasched and Oberer 1986). In a study (Mauhews et 
al. 1 994) that illustrates their use, a substrate phage library was 
constructed by inserting a randomized pentapeptide substrate 
fianked by di-alanine residues between a binding domain (a 
variant of human growth hormone; hGH) and pIII. After bind- 
ing the phage to hGH receptor molecules, the library was 
treated with furin to release clones bearing a furin substrate. 
Six rounds of enrichment led to a consensus motif that was 
used as a seed for a second series of randomized libraries. 
Furin selection from these libraries yielded the motif 
%RRF%RP. 

Sttccess and failure 

A theme common to both early (Cwirla et al. 1990; Devlin et 
al. 1990; Scott and Smith 1990) and more recent studies (e.g., 
Conley et al. 1994; Jellis et al. 1993; Kay et al. 1993) is that 
affinity-selection from RPLs can yield related sequences (e.g., 
as reviewed in Cortese et al. 1994, 1995) that as a collection 
provide insight (e.g., identification of residues that may be 
important to ligand-ligate interaction) not obtainable by meth- 
ods (e.g., peptide scanning; Geysen et al. 1987) that lead to a 
single sequence. Yet a number of studies (e.g., Schellekens et 
al. 1994; Bottger and Lane 1994; Bottger et al 1995; Yao et 
al. 1995; Wilson 1997; see also Lane and Stephen 1993) have 
not yielded the desired set of related sequences. 

A number of technical and other reasons for apparent fail- 
ures have been discussed (e.g.. Smith and Scott 1993; Bottger 
and Lane 1994; Clackson and Wells 1994; Scott and Craig 
1994; Schellekens et al. 1994; Bonnycastle et al. 1996). In 
some cases the solution may be straightforward. For example, 
epitopes recognized by some MAbs, such as the linear but 
discontinuous KKXi_4STSXo^HXK variants of a p53 epitope 
(Stephen et al. 1995), may be longer than those provided by 
the 6- to 9-residue libraries often employed, particularly in 
early studies. Thus, it can be hoped that a library of peptides 
of greater length will yield the desired target clones. Such 



libraries are now common. Other, more complex issues relate 
to the inherent diversity of the nature of ligand-ligate interac- 
tions. Accordingly, libraries displaying peptides in different 
structural contexts have been developed; some of these are 
discussed later. 

Given a diversity of options and considering the reported 
failures, a key question for a researcher considering phage 
display for an application such as epitope identification is 
"what ensures success?" and an important task has thus been 
to extract the rules that govern this. For RPL applications 
involving antibodies, an important paper relevant to this task 
is that describing an extensive study (Bonnycastle et al. 1996) 
designed to examine the interplay among the natures of (/) the 
immunogen (peptide, protein, or carbohydrate), (/;) the epitope 
(linear or discontinuous), (///) the antibody (monoclonal or 
polyclonal), and (/v) the phage-displayed peptide. Affinity-se- 
lection from 1 1 RPLs (displaying peptides of varying length 
in different structural contexts) with 17 antibodies led to the 
findings that (/) for a given antibody, not all libraries yielded 
a consensus sequence, (//) in some cases, different libraries 
provided different consensus sequences or peptides with no 
apparent consensus, (//7) for most MAbs, the tightest-binding 
peptides were identified from a few libraries, and (/v) the li- 
braries yielding these tight-binding peptides varied among 
MAbs (Bonnycastle et al. 1996). Thus, the diversity of factors 
involved in molecular interactions is reflected, in practical 
terms, in an inherent unpredictability. 

Yet importantly, it was also found that, given an adequate 
set of libraries, reactive peptides could always be identified for 
antibodies elicited against peptides, linear epitopes of proteins, 
and carbohydrates but not always for antibodies against dis- 
continuous epitopes of proteins. Thus, except for these latter 
epitopes, success can be assured, but it may require employ- 
ment of multiple libraries. Although this is not so straightfor- 
ward as may have been hoped for from the apparent simplicity 
of early phage display papers, perserverance may be suffi- 
ciently rewarding. For example, in a preface to two papers 
(Livnah et al. 1996; Wrighton et al. 1996) describing a peptide 
identified from phage libraries, Wells (1996) commented that 
the findings were "enough to reinstate one's belief in Santa 
Claus," for the authors had found a peptide able to form dimers 
and subsequently dimerize and activate the erythropoietin re- 
ceptor. Described by Wells (1996) as "try everything you can 
think of," the successful strategy considered the relationships 
among such issues as methodology, affinity, and avidity and 
employed a number of pVIlI and pIII libraries, as well as a 
variety of elution methods. 

A final issue concerns our definition of success, which (in 
this review) has thus far required that phage libraries yield (/) 
sequences that correspond in some way with a native ligand 
or ligate (e.g., an epitope) or (//) one or more collections of 
sequences for which consensus sequence(s) can be readily 
defined, but where there is no obvious homology to the native 
ligand or ligate (e.g., one or more mimotopes). This require- 
ment derives from the idea that interactions are determined by 
molecular fit between ligand and ligate, such as by comple- 
mentarity of shape and charge and hydrophilicity. Importantly, 
however, it has been found that interactions can also be medi- 
ated by a relatively large number of ordered water molecules 
at the ligand-ligate interface (e.g., Bhat et al. 1994). In such 
cases, where a molecular layer bridging ligand and ligate adds 
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a new level of complexity, it is conceivable that phage libraries 
will deliver a greater assortment of sequences, sufficiently 
diverse that success (now redefined in tenns of finding reac- 
tive versus related sequences) may not be recognized as such. 

Phage-displayed proteins and alternative methods of 
display 

Phage display can be classified (Smith 1993) as Types 3, 3+3, 
33, 8, 8+8, and 88, depending on the use of pIII or pVIII as the 
carrier molecule, the number of ftision products, and the vec- 
tors employed. 

In the Type 3 systems of the first RPLs, a glll cloning site 
in the phage genome enables display of a foreign peptide on 
each copy of pIII (Figs. IC and ID). In Type 3+3 display, 
recombinant glll is encoded on a phagemid (a plasmid con- 
taining the Ff origins of replication and packaging signal). 
Here, superinfection of phagemid-harboring cells by a helper 
phage (an Ff phage with a defective packaging signal and 
interference-resistance mutation(s)) yields viral particles con- 
taining phagemid ss DNA and comprised of both wild-type 
and recombinant pIIL In Type 33 display, a single phage 
genome encodes both wild-type and recombinant glll, the lat- 
ter often being inserted into the multiple cloning site of an M 1 3 
cloning vector. 

The relative merits of the pIII systems can be illustrated by 
three issues. The first, an issue in methodology, concerns in- 
trinsic affinity versus functional avidity. Here it is reasoned 
(discussed in Wells and Lowman 1992) that target peptides 
affinity-selected from Type 3+3 PDLs will possess greater 
intrinsic affinity for their cognate molecule than those selected 
from Type 3 libraries where high fiinctional avidity may derive 
from multiple low-affinity interactions. A second issue con- 
cerns the potential interference of fusions to pIII with its 
F-pilus adsorption function and consequent effects on phage 
infectivity. Here, interference can be minimized by Types 33 
and 3+3 display, where wild-type pIII is incorporated into the 
viral particle. A third issue also concerns biology. Here it is 
hoped that reducing the level of expression of an intrinsically 
toxic or difficult-to-export pIII fiision protein will facilitate its 
display (e.g.. Smith 1993; Cortese et al. 1994). Thus, larger 
polypeptides and proteins are often displayed with Types 33 
and 3+3 vectors and for similar reasons low copy number 
vectors (fd-tet derivatives) may be beneficial in Type 3 display 
(Smith and Scott 1993). Although these approaches have in- 
tuitive merit, their general effectiveness has not been system- 
atically addressed. Thus, success may vary from protein to 
protein and, in practical terms, will be readily determinable in 
applications involving display of a single protein or small 
number of variants but more difficult to assess for a genomic 
or other diverse library. 

Types 8, 8+8, and 88 vectors are the pVIII counterparts to 
the Types 3, 3+3, and 33 vectors. Although Type 8 display is 
uncommon because of the difficulties, noted eariier, in dis- 
playing peptides longer than five or six residues, the recent 
generation of organic landscapes on phage using Type 8 dis- 
play of octapeptides (Petrenko et al. 1996) points to their util- 
ity. Proteins as large as antibody Fab fragments have been 
displayed with Types 8+8 (Kang et al. 1991) and 88 (Huse et 
al. 1992) vectors. Importantly, like Type 3+3 vectors, Types 
8+8 and 88 vectors can be employed for display in which 
ligand-ligate interactions are locally monovalent and may 



possess certain advantages over Type 3+3 vectors (Bonnv- 
castleetal. 1996). 

Type 3 display 

Catalytically active E. coli alkaline phosphatase (AP) has been 
displayed in this way and, notably, catalytic activity required 
dimerization of AP-pIII on the phage surface (McCafferty et 
al. 1991). However, infectivity of the AP-pIII recombinants 
was <10% of that of the parent vector and proteolysis of the 
AP-pIII fusions was considerable. Only 5-10% of the fusions 
were intact in early preparations, 30-60% in later preparations. 
Similarly, in phage display of the 32-kDa B chain of ricin, the 
ratio of intact to proteolyzed B chain - pIII fiision products 
was 1 :3 (Swimmer et al. 1992). Proteolysis was also suspected 
in Type 3 display of the 58-residue bovine pancreatic trypsin 
inhibitor (Roberts et al. 19926). In contrast, no significant 
degradation was seen in display of fiilly active p-lactamase 
(Soumillion et al, 1994). That human interieukin 3 (hlL3) re- 
combinants, displayed in a T>pe 3 vector (Gram et al. 1993), 
could be identified on the basis of their distinct small plaque 
phenotype points to an interplay between hIL3-pIII fusion 
protein and phage-host biology. 

Type i+i display 

Among early phage display papers was that describing 
phagemid display of the 191-residue disulfide-containing 
hGH, accomplished by fusion of hCH to the plll C-terminal 
domain (pliy required for virion morphogenesis (Bass et al. 
1990). Importantly, hGH-pIIlc could bind to hGH receptor 
and was recognized by MAbs whose epitopes are sensitive to 
hGH conformation and immunoblots showed little evidence of 
hGH-pIIIc degradation. Although phagemid titers were ex- 
pectedly sensitive to the fraction of recombinant pill mole- 
cules incorporated, hGH-pIIlc expression could be limited so 
that few phagemid particles displayed more than a single 
hGH-pIII^ fusion protein. Other examples of Type 3+3 pIII^ 
fusions include (/) the a subunit of the high-affmity receptor 
for IgE (FceRI) in a form recognized by IgE (Scarselli et al. 

1993) and (//) the 23-kDa human ciliary neurotrophic factor in 
a form that bound to its natural receptor (Saggio et al. 1995). 
Degradation of the pIII^ fusion proteins was not examined in 
these latter papers. 

Other methods of Type 3+3 display involve fusions to com- 
plete mature pIII molecules. One such study involved display 
of the 33-kpa prostate specific antigen (PSA) in a form detect- 
able by antibodies recognizing nonlinear epitopes (Eerola et 
al. 1994). Here, Western blotting showed degradation of many 
of the PSA-pIII fusion products. Staphylococcal protein A 
(SPA) has been displayed in this manner (Kushwaha et al. 

1994) and it appears that most if not all of the phagemid-dis- 
played SPA was of the expected size. Both Types 3 and 3+3 
display of residues 1-176 of human CD4 produced 
CD4,_i76-pIII fusions in a form recognized by gpl20 of the 
human immunodeficiency virus (Abrol et al. 1994). Published 
Western blots hint to proteolysis of some Type 3+3 
CD4,_j76-pIII fusions. Infectivity of the Type 3 recombinants 
was reduced to 5% of that of the parent vector. 

Types 33 and 88 display 

In a study exploring both Types 33 and 88 display, sequences 
encoding a signal peptide and trypsin fused to pIII^ or mature 
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pVIII cloned into the M13mpl8 yielded virions comprised of 
both recombinant and wild-type coat protein (Corey et al. 
1993). Interestingly, the endogenous E. coli protease inhibitor 
ecotin copurified with the tryspin-displaying virions and, im- 
portantly, trypsin-coat protein fusions possessed near wild- 
type trypsin activity. However, that <20% of virions displayed 
the fusions and mean infectivity (averaged for wild-type and 
recombinant virions) was <25% of that of the nonrecombinant 
vector suggest profound effects of the fiisions on some aspect 
of host and phage biology. 

The construction and employment of a Type 88 vector to 
display bovine pancreatic trypsin inhibitor (BPTI) illustrates 
several aspects of vector design and host biology. In a manner 
similar to that employed for trypsin display, sequences encod- 
ing BPTI-pVIII fusions were cloned into M13mpl8 
(Markland et al. 1991). Homologous recombination among 
recombinant and wild-type gVIII was avoided by the use of 
alternative codons to construct a synthetic gVIII for the BPTI 
fusions. The first constructs, encoding a fusion of the pVIII 
signal peptide, BPTI, and mature pVIII, produced no recom- 
binant virions. Rather, Western blots of host cell lysates 
showed only a single pVIII-related protein with a size charac- 
teristic of unprocessed fusion protein. Use of an alternative 
signal sequence led to partial processing and reasonable levels 
of BPTI-pVIlI incorporation into virions (<2% of total pVIlI) 
were eventually achieved by use of a pHA {secY) host. Replac- 
ing the ATG start codon of wild-type gVIII with CTG in- 
creased the fraction of BPTI-pVlII incorporated into virions 
to -3% but reduced virion yields 10-foId. 

Type 8+8 display 

An early innovation involving Type 8+8 display is provided 
in a study (Kang et al. 1991; see also Barbas et al. 1991) that 
employed a phagemid to coexpress (/) antibody heavy chains 
(Vh-C^]) as fusions between a PelB signal peptide and pVIII 
and (//) light chains (Vl-Cl) as fusions to a second PelB signal 
peptide. The expectation that Vh-Chi-pVIII and Vl-Q mole- 
cules would associate in the periplasm to form properly folded 
Fab fragments were validated by studies including electron 
microscopy of phagemids decorated with antigen-Fab com- 
plexes. 



Directed evolution of a protein 

Phage display is popular for in vitro selection of peptides and 
proteins with altered properties, as suggested by the subhead- 
ing "phage fever" in a review of the field (Clackson and Wells 
1994). An early study, termed "directed evolution of a protein" 
(Roberts et al. 1992o), illustrates the approach. Here, a 1000- 
member PDL constructed by limited randomization of five 
residue positions of BPTI was biopanned with immobilized 
human neutrophil elastase (HNE) to yield a variant with >10^- 
fold greater affinity for HNE than the parental BPTI sequence. 
Other studies, of^en with multiple libraries of greater diversity, 
have similarly yielded target clones with altered target speci- 
ficity. These include not only other protease inhibitors (Dennis 
and Lazarus 1994a, 1994A; Wang et al. 1995a) but also an 
enzyme with altered specificity for active-site ligands (Wider- 
sten and Mannervik 1995) and zinc fingers with modified 
specificity in one or more fingers (Choo and Klug 1995). 



Gene fragment libraries 

Although the first PDL (Smith 1985) was a gene fragment 
library, studies employing gene fragment and genomic PDLs 
appear to be mainly limited to the following examples. 

Multivalent plll-display gem fragment libraries screened 
with antibodies 

In an early application of gene fragment libraries (Bleul et al, 
1991), a Type 3 library constructed with 200-bp DNase I gen- 
erated fragments of subgenomic DNA of the human papilloma 
virus type 1 8 (HP V- 1 8) was screened using plaque lifts (rather 
than biopanned) to identify two unique antibody-reactive pep- 
tides for each of two a-HPV-l8 sera. Similarly, a relatively 
small (200 transformants) Type 3 library constructed with 100- 
to 200-bp fragments of the gene encoding the bluetongue virus 
capsid protein VP5 has been employed to identify, by affinity- 
selection, two overiapping (44- and 50-residue) peptides reac- 
tive with a-VP5 antibody (Wang et al. 19956). 

A more extensive study involved Type 3 libraries con- 
structed with 50- to 200-bp gene fragments encoding peptides 
for (/■) Drosophila RNA polymerase II, (//) human 53 protein, 
and (//7) human cytokeratin 19 protein (Petersen et al. 1995). 
Epitopes recognized by MAbs raised against these proteins 
were identified after one or two rounds of affinity-selection. 
Interestingly, of the three cytokeratin 19 libraries constructed, 
only one yielded target clones; here, Cys-containing linkers 
had been used to present peptides in a disulfide-constrained 
loop. 

Identification ofligand-binding domains with a phagemid 
system 

Type 3+3 display has also been employed for gene fragment 
libraries. In a model system exploring the feasibility of pro- 
karyotic genomic libraries, a phagemid-based library con- 
structed with 100- to 700-bp sonication products of 
Staphylococcus aureus genomic DNA was biopanned with 
human IgG and fibronectin and subsequently screened by col- 
ony blotting and hybridization with oligonucleotide probes to 
identify known binding regions of staphylococcal protein A 
and fibronectin-binding proteins (Jacobsson and Frykberg 

1995) . Curiously, +1 or -1 frameshifts were found in the 
cloned inserts of all five fibronectin-binding clones and in 
related work, 47 of 50 clones derived from other libraries 
possessed similar frameshifts. Frameshifts were also noted in 
studies employing Type 8-t-8 display (Jacobsson and Frykberg 

1996) . 

cDNA libraries by means of Jun and Fos leucine zippers 
In a manner similar to an approach described earlier for Fab 
display (Barbas et al. 1991; Kang et al. 1991), an innovative 
phagemid system for display of cDNA fragments exploits the 
interaction of Jun and Fos leucine zippers (Crameri and Suter 
1 993; Crameri et al. 1 994). Here, expression of pelB-jun-glllc 
and pelB-fos-cDN A fusions and the resulting intraperiplasmic 
high-affinity Fos-Jun interactions (and subsequent covalent 
linkage mediated by engineered disulfide bonds) allows dis- 
play of cDNA-derived peptides on a phagemid surface. Impor- 
tantly, display involves C-terminal fusions to Fos rather than 
insertions near the pIII N-terminus. Thus, (/) cDNA or other 
gene fragments containing translation stop sites or other non- 
coding sequences can be expressed and (//) the N-terminal 
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mature region of the fusion proteins, believed important in 
membrane translocation of proteins, is constant among library 
members. 

Affinity-selection from gene fragment libraries 
The heterologous nature (in size and sequence) of displayed 
peptides sets gene fragment libraries apart from other PDLs, 
which characteristically display (/) peptides of enormous di- 
versity but of small size, as in RPLs or {//) large peptides or 
proteins in which a small number of residues are varied within 
an otherwise uniform scaffold, as in antibody display. Since 
clonal competition may thus prove to be relatively greater in gene 
fragment PDLs, conventional methods of affinity-selection, 
which include library amplification between multiple rounds 
of enrichment, may be inappropriate. 

In this context, it may be relevant that several applications 
of gene fragment libraries reviewed above (/) avoided affinity- 
enrichment and instead used direct screening by plaque lifts 
(Bleul et al. 1991) or (//) employed only one or two rounds 
(Petersen et al. 1995) of affinity-enrichment followed by 
screening. In studies that employed PDLs constructed with 
DNase I generated fragments of the gene encoding B. pertussis 
filamentous hemagglutinin, we similarly employed a single 
affinity-enrichment followed by screening by plaque lifts 
(Wilson 1997; Wilson et al. 1997). As we found in follow-up 
studies, clonal competition during library amplification does 
favour growth of nonrecombinants or other clones not recog- 
nized by antibody but not to the extent of significantly negat- 
ing the gains from enrichment (Wilson 1997). Thus, our 
caution may have been unnecessary. However, in studies of 
other gene fragment libraries, virion yields were found to vary 
100-fold among clones, in some cases 10''- fold (Wilson 1997). 
With the latter libaries, clonal competition would expectedly 
be an issue. 



Innovations in peptide presentation 

Several methods have been developed to display peptides in a 
constrained, often defined, confonnation. The simplest of 
these employs a disulfide-bonded loop. Other methods involve 
peptide display on a relatively larger molecular scaffold. 

Disulfide-constrained peptides 

Because virion morphogenesis involves insertion of pIII and 
pVIII into the inner membrane with their N-termini within the 
periplasm, paired cysteines flanking randomized sequences 
(e.g., *CX6C*, where * denotes disulfide bond partners) can 
be used to present peptides in a disulfide-bonded loop (Luz- 
zago et al. 1993; McLafferty et al. 1993; Zhong et al. 1994). 
Cys-constrained RPLs have been employed with remarkable 
success. The earlier-described identification of a 20-residue 
disulfide-constrained peptide capable of activating the 
erythropoietin receptor is one example. The isolation of brain- 
and kidney-targeting phage after intravenous injection of Cys- 
constrained RPLs into mice (Pasqualini and Ruoslahti 1996) 
is another novel illustration. 

The relative merits of disulfide constraint can be illus- 
trated by more conventional studies. In one application, a di- 
sulfide bond was required for high-affinity binding of 
MAb-reactive peptides identified by affinity-selection from 



a X*CX4C*X RPL (McLafferty et al. 1993). In another, a 
disulfide bridge was required for binding of affinity-selected 
clones to a conformafion-dependent MAb but not for bind- 
ing to nonconformation-dependent MAbs (Zhong et al. 1994). 
The latter study also found that Cys-constrained peptides pos- 
sessed greater antigenic fitness (the ability of a peptide to elicit 
an immune response against the native antigen) than uncon- 
strained peptides. 

In contrast, work that involved affinity-selection from both 
unconstrained (Xg) and constrained {^CX^C*) libraries 
(McConnell et al. 1 994) found that while the Xg library yielded 
peptides with residues (VYIH) corresponding to those of the 
native antigen, the ^CX^C* library yielded only a mimotope 
sequence (DWWH). Moreover, follow-up studies showed that 
YVIH, when constrained within a disulfide-bonded loop, pos- 
sessed considerably reduced affinity for the cognate MAb. 
Finally, a further study (McConnell and Hoess 1995) identi- 
fied the MAb-reactive consensus sequence LEPW from both 
constrained and unconstrained RPLs. Notably, Bonnycastle et 
al. (1996) have found that the preference of an antibody for a 
specific type of disulfide constraint, or no constraint at all, is 
essentially unpredictable. 

In reviewing constrained display, Ladner (1995) has sug- 
gested that increasing the disulfide-imposed constraint im- 
posed on a peptide segment (e.g., RGD in XXXRGDXXX, 
*CXXRGDXXC*, and *CX*CRGDC*XC*) decreases the 
likelihood that it will bind to any particular target, but if a 
target is found, the peptide will bind more tightly and specifi- 
cally. 



Presentation on carrier molecules 

Tendamistat, a 74 amino acid molecule comprised of two 
three-strand P-sheets, has been employed as a scaffold in 
which two adjacent strand-connecting loops (13 and 6 resi- 
dues) can be randomized (McConnell and Hoess 1995) in 
Type 33 display. An expected advantage of constrained dis- 
play involving two (versus one) randomized loops is the better 
approximation of surface structures involved in intermolecular 
recognition. The 61 -residue minibody (Pessi et al. 1993) is 
similarly comprised of two three-strand p-sheets and retains 
the regions corresponding to two adjacent hypervariable loops 
(HI and H2) of the antibody domain from which it was 
engineered. Its utility was demonstrated (Martin et al. 1994) 
by the identification from a Type 3 library of a variant whose 
H2 region shared homology with a portion of the IL-6Ra 
receptor and which inhibited the biological activity of human 
interIeukin-6. A more potent variant was identified by Type 
3+3 minibody display in which the HI loop of the previously 
identified H2 variant was randomized (Martin et al. 1996). 

Phage display of randomized a-heiices has been explored 
in a study (Bianchi et al. 1995) in which five helix positions 
(that are most exposed in the helix and cluster on one side) of 
a 26-residue zinc finger were randomized. Notably, the cou- 
pling of zinc coordination and folding in zinc fingers allows 
zinc-dependent binding to be used as a control against struc- 
turally undefined sequences. Phage display involving random- 
izing residues of a surface comprised of two a-helices of a 
monovalent Fc-binding domain of staphylococcal protein A 
has also been explored (Nord et al. 1995, 1997). 
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Innovations in identification of target clones 

Several methods to identify target clones exploit the periplas- 
mic association of separately translocated molecules (re- 
viewed earlier) to fomi a complex that becomes incorporated 
into a virion. The innovation lies in the physical division of 
pIII into its two functional domains, the N-terminal (pIIIn) 
adsorption domain, and the C-terminal (plllc) morphogenetic 
domain. 

In the direct interaction rescue system (Gramatikoff et al. 
1994, 1995), an invariant bait molecule such as c-Jun is cloned 
as an N-tenninal fusion to plll^ and variant fragments such as 
those from a human cDNA library are cloned as C-terminal 
fusions to pIIIn- In cells harbouring a cDNA fragment encod- 
ing a peptide that interacts with the Jun bait (Fos was the target 
in the model system described), periplasmic association of 
Jun~pIII(^ and pIIIj,j-cDNA fusion products yields infectious 
virions constructed with the heterodimeric complexes; in other 
cells, noninfectious virions containing only Jun-pIII^ are pro- 
duced. The similar selectively-infective phage system (Kreb- 
ber et al. 1995) employs single-chain antibodies (scFv)-pIIIc 
and pllljj-antigen fusions. Infectivity is restored by cognate 
interactions between plllisj-antigen and scFv-pIIIc. Notably, 
antigen or antibody sequences can be randomized or otherwise 
varied. A similar system has been developed (Duenas and 
Borrebaeck 1994) to mimic clonal selection by the immune 
system. 

Concluding remarks 

Ff phage display has been notably successful, even to the ex- 
tent of enabling research into questions not readily answerable 
by other means. Yet not all ventures in this evolving field have 
been (entirely) successful. Among the reasons for this, as we 
have argued, are that phage and host cell biology may impose 
greater limitations than we initially may have hoped for. 

In part because of such limitations, promising systems em- 
ploying lytic phage such as X and T4 have been developed 
(Maruyama et al. 1994; Dunn 1995; Efimov et al. 1995; Stem- 
berg and Hoess 1995; Mikawa et al. 1996; see also Cortese et 
al. 1 996). The seemingly clear advantage to lytic phage display 
is that incorporation of recombinant proteins into virion coats 
or tail proteins does not require the host's membrane translo- 
cation apparatus or other aspects of membrane-mediated as- 
sembly and export. Nevertheless, lytic phage display will 
likely prove to have its own characteristic shortcomings and 
Ff and lytic phage display can thus be expected to provide 
complementary and increasingly useful tools for research. 

At present we have only a largely general understanding of 
the ways in which Ff phage and host biology must (in princi- 
ple) and do (in experience) impose limitations on successful 
phage display. Hopefully, fijture studies will provide greater 
insight into the ways in which a displayed peptide or protein 
influences its ability to be successfully displayed for, as our 
understanding increases, so will the utility of phage display. 
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